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Abstract 

A novel approach to complex problems has been previously applied to graph classification and 
the graph equivalence problem. Here we apply it to the NP complete problem of finding the largest 
perfect clique within a graph G. 
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I. INTRODUCTION 



In a novel dynamical approach the n vertices of a graph G are mapped onto n physical 
points located initially at equal distances from each other forming a symmetric n simplex in 
n — 1 dimensions. Attractive/repulsive forces are introduced between pairs of points corre- 
sponding to connected/disconnected vertices in the original graph G. We then let the system 
evolve utilizing first order Aristotelian dynamics|]I], ^ |^. We tune the relative strength of 
repulsive and attractive forces to he v/n with v the average valency i.e. average number of 
vertices connected to a given vertex so as to have no net average repulsion/attractions. 

We found, that as the system evolves various physical clusters of points tend to form. 
These physical clusterings reveal clusters (or imperfect cliques) in the graph - namely groups 
of vertices with a larger than average mutual connectivity. Also the matrix of distances 
Rijit) between the various points rj(t) and rj{t) is characteristic of the graph topology: 
points corresponding to vertices which are "close in the graph" namely have (relatively) 
many, short, paths connecting them will move closer together and conversely, points which 
are "far in the graph" tend to move apart. 

The distance matrix and clusters are important graph diagnostics. In particular the first 
allows us to solve easily the graph equivalence problem namely to decide if two connectivity 
matrices Cij and Ciij correspond to the same topological graph and if they do to find the 
relabelling of vertices which makes C and Ci identical. 

These results are of considerable practical importance. Still neither of the above problems 
belongs in the special class "NP complete" problems. The latter consists of problems such as 
the travelling salesmen problem and the satisfiability problem for which a putative solution 
can be readily checked in polynomial time yet no polynomial solution method is presently 
known 0]. 

Many of these problems can be phrased in terms of graphs as the task of finding some 
specific graph g inside bigger graph G. Further, all these problems which superficially seem 
very different are at a basic level ,equally difficult: If a method of solution in polynomial 
time is found for one such problem then all the problems should be solvable in such time 
by essentially the same method. Conversely if we can prove that just one NPC problem 
necessarily require, non-polynomial time for its solution, the same holds for all of them. Two 
of us have recently conjectured^] that a new variant of our approach namely of dynamically 
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docking rigid simplexes s and S representing g and G can solve the "g inside G" problems. 
Here we wish to present the first concrete application of the original, point translation or 
single simplex distortion algorithm (SDA), to an NPC problems namely that of finding the 
largest perfect clique in G. 

To most clearly illustrate the essence of the problem we consider the "students in dorm" 
example used in the general description of the Clay institute prize offered for resolving 
P = NP problem|P|. We have = 400 students out of which we need to select n = 100 
which can live together in a dorm, subject to a very long list of mutual exclusions. This 
list states that student # 1 cannot be together with any one student from a specific set of 
say 200 other students, student student # 2 cannot be together with any one from another 
partially overlapping set with a comparable number of students etc. How can we pick up a 
set of 100 students such that any one is completely compatible with the other 99, and what 
is this set? Clearly this is a particular example of the general satisfiability problem where 
the conditions imposed are just "two body" exclusions. 

It is also a particular case of looking for a graph g inside G where g is a. perfect clique 
of vertices each of which is connected to all other members in the clique. We encode into 
G with = 400 vertices the various mutual exclusion constraints by not connecting with 
edges vertices Vi and Vj if student # i and student # j are not compatible, and connecting 
by edges compatible pairs. Clearly if we find within G a clique with n vertices it means, 
by our very construction, that the students to which these vertices correspond are indeed 
all mutually compatible. We could construct in judicious manner various smaller consistent 
subsets, and try piece them together. Often, however a new inconsistency is revealed and 
we need to pursue other alternatives. While we certainly can do this in far less steps than 



In desperation we might decide to resort to the following primitive alternative and simply 
let the 400 students "fight it out". In this all out war each student will try to push away 
members which are inconsistent with him and pull in those which are. This collective 
natural selection of the "compatible" - which may well be a prerelevant social phenomena 
- would hopefully leave us with the desired large group of mutually consistent individuals. 
Unfortunately the outcome of such a 400 way "Somo" fight of staying in the ring is strongly 
biased by the initial arbitrary placement of students in the two dimensional arena^]. Thus 




the difficulty of the problem seems to grow at least exponentially with n. 
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we could envision a situation where an ideal group of completely compatible dorm candidates 
is placed in the center of a group of highly unpopular ones and is "ejected" together with 
them. In order to generate the correct large clique we need to completely unbias the starting 
position and avoid the severe constraints due to our existence in a physical world with limited 
number of dimensions. This can be done only if we go to = — 1 dimensions and place 
the "students" which, in the inverse problem that we are really after, are metaphors for the 
physical points representing the N vertices of the graph, at the vertices of a symmetric 
simplex. 

II. SEARCHING FOR CLIQUES 

Our search for perfect cliques uses the same physically motivated dynamical algorithm 
previously developed to identify via the physically bunched points clusters or "imperfect 
chques" in a graph [|I|. We found that to adapt this algorithm for the present purpose we 
need only to enhance the ratio of the repulsive and attractive interactions. Originally it was 
chosen to be: 



which could be relatively small. Thus for an average valency of 10 in a graph with 100 
vertices it is only 0.1. However, in order to meet the criteria of perfect cliques we clearly 
have to significantly enhance the strength of the repulsive interactions so as to avoid points 
which are connected to a fairly large number of the points in the clique but not to ALL of 
them from joining in. Thus in the first round of applications we used 



We first considered a small clique of n = 7 in a graph with = 100 vertices with the 
connectivity matrix of Fig.(||). In addition to the clique this matrix consists of six clusters 
with randomly created internal connections with average valency 20%. These clusters, in 
turn, have been randomly interconnected with a large valency 10%. To simulate a real-life 
situation of networks with unknown structure (topology) we randomly permute the rows and 
columns of the matrix C obtaining the reshuffled matrix C shown in the Fig.(^. Next we 
apply our algorithm for clusters reconstruction using equal attractive and repulsive constant 
forces in n — 1 = 99 dimensional space. The vertices of the 100-simplex were allowed to move 



Urep{r)/Uau{r) = v/u, 



(1) 



Urep{r)/Uau{r) = 1. 



(2) 
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FIG. 1: Connectivity matrix C with 7x7 clique. 

under the influence of tlie forces on the 98-dimensional hyper-sphere in 99-dimensions. After 
a number of steps we analyzed the mutual distances between the vertices of the simplex and 
group neighbors which are close to each other into cliques. The new cluster-connectivity 
matrix is shown in Fig.(|]). We see that due to the large repulsive forces most vertices did 
not move close each to others. The only vertices grouped together are the ones that belong 
to the chque. 

To see how the algorithm works for the case of overlapping cliques we considered two 
examples. The first example includes two cliques 7x7 and 15 x 15 with a 2 x 2 overlap on a 
"background" of a 100 x 100 matrix with the same average 10% connectivity as above. The 
corresponding connectivity matrixes before reshuffling is shown in Fig.(^). The reconstructed 
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FIG. 2: Reshuffled connectivity matrix with C/ 7 x 7 chque. 

connectivity matrix for the cliques is shown in Fig.(^. The second example involves two of 
cliques 10 with a large overlap of 5 in Fig.(|^). Our reconstruction yields one clique of 10 and 
one of 5 Fig.(0). As expected we fully reconstruct the largest cliques. This is done at the 
expense of correspondingly reducing the size of the reconstructed part of the overlapping 
smaller or equal size cliques. 

Other examples involve a. n = 100 clique in a = 400 graph corresponding to the 
"students in dorm" question. In addition we had an imperfect clique or cluster of 300 with 
average valency of 20% on a background of 10% (Fig.(H)). The successfully reconstructed 100 
clique after reshuffling is shown in Fig.(^. We next increase the "background" of average 
valency to 90% for the 300 x 300 cluster and to 50% of that average interconnections in 
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FIG. 3: Clique connectivity matrix for reshuffled connectivity matrix C. 

Fig.(^). Fig. ([T^) shows the reconstructed results after reshuffling. The clique is imperfectly 
reconstructed. However, increasing Urep/Uatt from one to two improves the reconstruction 
as shown in Fig. (p!3|) . 

The above results are to our mind fairly impressive. These results show that by mild 
tuning our original code solves in very short time and for many cases the NPC problem of 
the largest clique. It may still fall short of solving it in all cases. As a worst case scenario 
we could envision a vertex ( or several such vertices) which are connected to all the vertices 
in the clique save one. To avoid these vertices from joining the clique even in this case, 
rendering it imperfect, we need that the single repulsion due to the missing edge, overcome 
all the n—1 attractions to the rest of the points in the clique . Thus the strict perfect clique 
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FIG. 4: Connectivity matrix C with 7x7 and 15 x 15 cliques, 
worst case scenario demands 

Urep > (n - 1) • Uatf (3) 

This wildly differs from the above eq.(refx): for a graph G with 100 vertices v — and a 
clique of size n = 10 we need a factor hundred enhancement of the ratio Urep/Uatt from 0.1 
to 10! 

Our 3 — d based intuition would strongly suggest that this stops formation of all cliques, 
perfect or not, since as any given point tries move towards its "Designated" clique it may be 
"Overwhelmed" by the many repulsive forces which will prevent it from joining the clique. 
The configuration with the perfect clique (and the largest perfect clique in particular) fully 
formed i.e having all its vertices collapse at a point is indeed the desired final lower energy 
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FIG. 5: Reconstructed clique connectivity matrix for C/ 7 x 7 and 15 x 15 cliques. 

state. However there may be false local minima which trap our system just like in spin 
glass and protein folding problem 

This is indeed most certainly the case for "low" dimensionalities. However with d = N—1, 
as is the case here, the above intuition fails. Specifically any one given "test point" feels 
just as many different forces in the directions of the other particles namely — 1 as there 
are independent directions d = N ~ 1 to move in. Ideally therefore the test particle should 
be able to simultaneously respond to all different — 1 forces, move in the direction of all 
the attractors and away from all the repellers and in the process further lower the energy of 
the system. We can adopt a local, non-orthogonal, system of coordinates where the — 1 
axes are aligned along the unit vectors pointing from r - the chosen point, to n, r2, . . . rAr_i 
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FIG. 6: Connectivity matrix C for two 10 x 10 overlapped cliques. 



the other N — 1 points. Using our choice of constant forces[§ we have then a net force 



N-l 



m = E 



i=l 



r — r,- 



r — r,- 



(4) 



which is the sum of the unit vectors along these axes with + and - signs. Since these are 
— 1 hnearly independent vectors the sum never vanishes > always and no local 

minimum arises. 

There is one "small" correction however to the above argument. It is due to the fact 
that in our original algorithm we have introduced one further constraint on the motion of 
the points, namely that at all times on the unit circle \ri{t)\ = 1. It seemed necessary 
in order to avoid running away to infinity of repelling vertices or collapse to the origin of 
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FIG. 7: Reconstructed clique connectivity matrix for two 10 x 10 overlapped cliques. 



attracting ones. This does however introduce an extra normal reaction force that could 
in fact cancel the above sum in eq.(^, and thus yields local minima. Hence in the final 
runs we did not impose this constraint. Instead we modified our code to facilitate handling 
the increasing distances between points at later stages of the evolution. We found that 
our program fully reconstructed the maximal clique |lO(l. This happens regardless of the 
degree of the connectivity of the random background and also of the existence of large and 
partially overlapping slightly smaller cliques. Thus for the n=100 maximal clique in an 
N=400 vertex graph (i.e the students choice for dorm problem) we added two 80x80 cliques 
which overlapped our 100x100 clique in two 60x60 patches which ,m turn, had a 20x20 
overlap and used a background with 70% connectivity Fig.(|l^). Even under such seemingly 
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unfavorable conditions we reconstructed our clique Fig. ([151). 
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FIG. 9: Reconstructed clique connectivity matrix for 100 x 100 clique. 
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FIG. 12: Reconstructed clique connectivity matrix for 100 x 100 clique with "heavy" background. 
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FIG. 13: Reconstructed clique connectivity matrix with doubled repulsive forces for 100 x 100 
clique with "heavy" background. 
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FIG. 14: Connectivity matrix with three overlapping chques and 70% random background. 
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FIG. 15: Reconstructed cliques for 400-matrix with 70% background connectivity. 
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